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Abstract 

Let X — {Xn\n>i and Y = {i^n}n>i be two independent random sequences. We 
obtain rates of convergence to the normal law of randomly weighted self-normalized 

sums 

V'„(X,Y) = ^X,y,/K, Vn = ^Y^ + ---+Yl 

These rates are seen to hold for the convergence of a number of important statistics, 
such as for instance Student's i-statistic or the empirical correlation coefficient. 

1 Introduction 

Let X = {Xn}n>i and Y = {Yn}n>i be two random sequences. In this paper we in- 
vestigate the rate of convergence to the normal distribution of the randomly weighted 
self-normalized sums 

" I 

i^n = M^, Y) = ^ XiYi/Vn, Vn = ^Y^ + ---+ (1) 

i=l 

The random variables ipn appear in some important statistics. For example, when 
testing the null that the mean of a population Y is equal to 0, one uses the Student 
t-statistic 

VEY 

Denoting by ipn = V'n(l) Y) = ^27=1 ^i/^n the usual self-normalized partial sums, one can 
easily see that 

T„ = ^„[(n-l)/(n-^2)]i/2^ 

so that and Tn are equivalent (in terms of a 1:1 correspondence). See, e.g., Efron [TO] , 
Logan et al. [15] or Gine et al. [12j for a discussion. 

More generally, we could phrase the above testing problem as Hq : /3 = versus 
Hi : /3 7^ in the linear model Zi = j3Xi + 1^. (The setup Xi = 1 for all 1 < z < n is 
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contained as a special case.) Then '0„(X, Z), which reduces under Hq to Vn(X,Y), will 
serve as a natural test statistic. As a matter of fact our research was originally motivated 
by this problem (see Hallin et al. [13j). We were interested in obtaining asymptotic 
normality of this test under as general as possible assumptions on the errors YJ. 

Another related example where V'n appears is the empirical correlation coefficient. If 
the sequences X and Y are centered then the empirical correlation is 

p„(X, Y) = M^,Y)/Br, = ^'='f/^\ Bn = Jxf + ... + Xl 

We will see how, under moment conditions on X, convergence rates for tpn can be transfered 
to pn (see Lemma [22] below). 

Besides their statistical applications, self-normalized sums have proven to be chal- 
lenging mathematical objects with interesting properties. As a consequence they have 
attracted considerable attention in probability theory. For example Logan et al. [15] stud- 
ied the limiting distributions of ipni^,^) when Y is a centered i.i.d. sequence with heavy 
tails, and conjectured that Vnll, Y) is asymptotically normal if and only if Y is in the 
domain of attraction of the normal law. Gine et al. |12j proved that this conjecture holds 
true, while Chistyakov and Gotze [9] settled the question of the convergence of Student's 
statistic by giving necessary and sufficient conditions for these sums to allow limiting dis- 
tributions which are not concentrated on {±1}. More recently Benktus et al. |3j studied 
the limiting distribution of the non-central t-statistic under different assumptions on Y; 
they show, inter alia, how this limiting distribution depends critically on the existence of 
fourth moments for the 1^. For a comprehensive study of these and related questions we 
refer the reader to the book Lai et al. |14j . 

In a slightly different setup, Breiman [6] provides necessary and sufficient condi- 
tions for the weak convergence of randomly weighted self-normalized sums of the form 
^jXili/^^yi. Mason and Zinn fT6] settle several questions left open by Breiman [6], 
and deduce the asymptotic distribution of V'n(l5 Y) in the case of symmetry. 

In this paper we will be interested in the rate of convergence to the normal distribution 
of V'n(X, Y) as well as of p„(X, Y). The case when X = 1 and {Yi} are independent with 
finite variance is already well established. Bentkus et al. p| give sharp rates for convergence 
of Student's statistic, and thus equivalently for V'n(l)Y), in the non-i.i.d. case. Explicit 
constants in these bounds were derived by Shao [18]. See also Bentkus and Gotze [4] for 
further references. There seem to be no similar investigations for ^/^^(Xi Y). To the best 
of our knowledge, no similar results for the convergence rate of the correlation coefficient 
/9„(X,Y) exist. 

Our approach is as follows. We first state in Lemma 12.11 a general result, which is 
simple to prove and which provides a bound that holds without any hypothesis on Y, be 
it on its moments or dependence structure. The main target is then to work out the thus 
obtained rates explicitly by imposing different assumptions on the sequence {Yi}. This is 
done through a number of subsequent results. An interesting feature in our approach is 
that (with one exception) we do not work with truncation arguments, even when assuming 
an infinite variance for the Y's. 
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2 Results 



Recall that if P and Q are any probability measures on the real line, then the Wasserstein 
distance is given by 

dw(.P,Q) = sup / hdP - I hdQ , 
hen J J 

where Ti is the class of Lipschitz 1 functions, i.e. Ti = {h : M — ?■ M; ||/i'|| < 1} with 
11/11 = sup^jgR The Kolmogorov distance dxiP^Q) is defined similarly, with 7i 

replaced by the class of indicator functions /i^ (•) = /{•< z}, z E M. If 1/ and S are random 
variables on the space {Q, A, P) then dwiV, S) will be written for dw{P o V^'^jP o S~^), 
where P o is the image measure of V under P. Similar is the definition for dxiV, S). 
Throughout Z stands for a standard normal random variable and we are interested in 

(ivi/('(/'n(X, Y),Z) and (iH/(p„(X, Y), Z) , 

and 

dif(V'n(X, Y),Z) and (iii-(p„(X, Y), Z) , 

under the assumption that X and Y are independent. 

The following simple Lemma gives the first step in our approach. 

Lemma 2.1. Let V'n(X, Y) be defined as in where X and^ are two mutually indepen- 
dent sequences. Assume that {X/^} is i.i.d. with EXi = 0, EXf = 1, ^3 = i^jXip < 00. 
Then 

di^(V^„(X,Y),Z) <0.566A, (2) 

where 

n 

A = ^^|5fc,„|3 with 5u,n = Yk/Vn. (3) 
k=l 

Furthermore 

diy(^„(X,Y),Z) <6A. (4) 

Proof of Lemma \2A[ We show first 1^. Let Y„ = [Yi,. . . , Yn), -Fra(yn) be the joint law 
of Y„ and set = Ya^ivI- Then using a version of the Berry-Esseen theorem for 
independent random variables, we obtain for any z G M 



P(Z<z)-P(^„(X,Y) <z)| 

< / \P{Z <z)-P{M^,yn)<z)\dFn{yn) 

„ n 

< CE\Xi\^ / y^{y;/vnfdFn{yn) = C^^EA. 

JR" ~1 



[ P{Z <Z)- P{M^, Y) < z\Yn = yn)dFn(yn) 

JRi 



By a recent result of Shevtsova [19J, C < 0.56. 

The proof of dH) can be done in the exact same way, using Corollary 4.2 in [8]. 

□ 

We remark that in Lemma l2.ll we do not put any restrictions on the sequence {Y^}. 
This means that, in theory, we can obtain non-trivial bounds even if this sequence is not 
independent or identically distributed. Of course, the difficulty then resides in working 
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out A explicitly, which we do under different assumptions in Theorems 12. 1|, 12.21 and 12.31 
below. We will see that Lemma |2 . 1 1 provides optimal bounds in several special cases. 

Let us consider first the following special case, which gives an application to self- 
normalized sums ipni^,^) when the Yi are not necessarily independent nor identically 
distributed. We assume instead that 

{Yi,...,Yn) = {±Yu...,±Yn) (5) 

for all choices of +, — . This form of symmetry, known as sign- symmetry, is more general 
than spherical symmetry (see e.g. Serfling [17j ) and is to be likened with the concept 
of orthant symmetry discussed by Efron [lO]. Sign-symmetry is obviously satisfied if the 
Yi are symmetric and independent random variables. Under this condition the following 
result (which should be also compared to Mason and Zinn [16\ Corollary 6]) holds. 

Corollary 2.1. Assume that ([5]) holds and set Sn = X^fc=i^fc- Then 

dw{Sn/Vn,Z) < A and dK{Sn/Vn,Z) < 0.56 A, 

TOi/i a = eLi^I'^mI'- 

The proof follows simply by applying Lemma [2.1l to ipnO^, Y) with {X^} i.i.d. Rademacher 
variables, i.e. Xj. = ±1 with probability 1/2. Then due to the symmetric distribution of 
the Yfc, we have that Sn/Vn and '0„(X, Y) have the same distribution. 

The next Lemma gives a simple criterion for switching from ?/;„(X, Y) to p„(X, Y). 
While we impose 4 moments for Xi, we keep the assumptions on Yi general. 

Lemma 2.2. Let the assumptions of Lemma \2. 1\ hold and assume in addition that 771,4 := 
EXf < 00. Then, ifn/logn > 87774, 



dw{M^,^),V^Pn{^,Y)) < 

Let us consider once more the testing problem Hq : /3 = versus Hi : /3 7^ in the 
linear model Zi = j3Xi + Yi. The previous lemma in connection with Lemma 1 2. II shows, 
if the regressors Xi are centered (a condition which is convenient but could be modified) 
and have 4 moments then we get under Hq for very general errors Yi the convergence of 
the correlation test statistic \/np„(X, Z) to the normal, with an approximation error of 
order 0(n~^/2 + A). 

When {Yfc} is a stationary sequence, then A = nE\6i^n\^ and obtaining a rate of 
convergence to the normal distribution is entirely reduced to calculating the third absolute 
moment of Si^n- We now concentrate on obtaining A under different moment and tail 
assumptions on the sequence {Y^} under the i.i.d. setup. We first work out A under the 
sole assumption iiJlYilP < 00, p £ (2,3]. In this case we obtain the "usual" convergence 
rates. 

Theorem 2.1. Let {Yi} be an i.i.d. sequence, let p € (2,3] and assume EY^ = 1 and 
E\Yi\P < 00. Then 

A = r7^|^i,„|^ < nE\Si,nf ~ ^1^1^77^-^/^. (6) 
(Note that the first inequality in ^ follows from |(5i,n| < !•) 
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Remark 2.1. A look at the proof of Theorem \2.1\ suggests that similar results may he 
obtained under different dependence conditions, too. In fact, besides some purely analytic 
estimates, which hold for any sequence {Yk}, we only make use of moment inequalities 
which exist in different generality for many weak dependence and mixing concepts, respec- 
tively. 

Next we consider the case when we have knowledge on the tail probabilities of the 1^. 
Let T{p) denote Euler's gamma function. 

Theorem 2.2. Let {Yi} be an i.i.d. sequence, let I < a < 2 and PiY^ > a;) ~ l[x)x~°^ , 
where i{x) is slowly varying at oo. If ay '■= EY^ < oo, then we have for any 7 > a 

a^r(7) 

Example 2.1. Consider the case P{\Yi\p > x) ~ P e (2,3). Then E\Yif = 00 

for any f3 > p, while E\Yi\P < 00. Applying the above result with 7 = 3/2 we obtain 



A = nE \6i^nf ~ - vv" -^^^ Yo/9A ' 

apL [6/2) log n 

Hence the additional knowledge of the tail behavior yields a slightly better rate than the 
one obtained in ([6]). 

We now turn to the case when we have infinite second moments. 

Theorem 2.3. Let {Yi} be an i.i.d. sequence, let P{Y^ > x) ~ i{x)x~^ , with i{x) slowly 
varying at 00. If EiY^) = 00, then, for any ^ > \, we have 

7 - 1 L(a„) 

where L{x) = (^£{t)/t)dt and {a„} is a sequence satisfying a„ ~ nL{an)- 

Example 2.2. Assume that P{Y^ > x) ~ x^"'^(log x)^^. Then EY^ < 00 and by Theo- 
rem \2.2\ we get that for all n > 1 



r((3-p)/2)r(i+p/2) 



di^(V'„(X,Y),Z) <^(log 



n 



-2 



for some large enough constant A. If P{Y^ > x) ~ x ^(logx) then EY^ = 00. But 
since £{x) = log log x and a„ ~ n log log n we still get by Theorem \2.3\ applied with 7 = 3/2 



n) 



dK{M^,^),Z) < A ((log n) log log 

and thus an explicit convergence rate to the normal law. 

We conclude with a result which shows that, even in the case when Yi is in the domain 
of attraction of an a-stable law with a strictly less but close to 2 we can get non-trivial 
bounds. 

Theorem 2.4. Let {Yi} be an i.i.d. sequence, assume that P{\Yi\ > x) ~ ^(x)x~" with 
a G (0,2). Then for any 7 > 1 

TP\x |27 r(7-a/2) 

"^'^^•"'^ r(7)r(i-a/2) - 
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Example 2.3. We apply this result with 7 = 3/2. Let a = 2—e for small e > 0. Observing 
that under the above assumptions r(3/2 — a/2)/r(3/2) < 2, and l/r(e) ~ e for e ^ we 
get by Lemma \2.1\ that for large enough n 

d;^(Vn(X,Y),Z) < 0.56^3 e. 

When the distribution of the Yk is symmetric, we can conclude that for sufficiently large 
sample size n we have dxiSn/Vn, Z) < 0.56 e. 



3 Proofs 



In the sequel we need the following version of Hoeffding's inequality (see e.g. Shao \18\ p. 
145]). 

Lemma 3.1. Let {Zi, 1 < i < n} be independent non-negative random variables with 



P Zi<xj< exp 



{lJ--xf 
2a2 



Proof of Lemma \2.S\ . Let 

(f) := V^/On(X, Y). 

Note that (f) = ^/-^V'- Then, for all /i G "H, by the mean value theorem we get (recall that 

WW < 1) 

\Eh{(l)) - Ehiijj)\ < E 



Let e G (0, 1). Then the last term is bounded by \An^ + An \ where 



^1^) := E 



n 

n 



and 



4^) := E 

Define K4 = E{Xl - if. Then 




m{Bl<{\-e)n) 



1^1 



< 



1=1 



1/2 



1 /7t4 



1 — £ V n 



(2) 



For estimating A)n we use 



n 



i=l 



< 



Ki=l 



1/2 . „ 



Br, 
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Thus 

< E\^ - Bn\I{Bl < (1 - e)n} < ^P{bI < (1 - e)n). 

By Lemma [3.1l we get that P{B^ < (1— e)n) < exp(— e^n/(2m4)). Collecting our estimates 
we have 

\Eh{(l)) - Eh{'^)\ < Vnexp(-e2n/(2m4)). 
For large enough n we have := (2m4 log n)/n < 1/4, and since = K4 + 1 we conclude 



n vn V n 

□ 

Proof of TheoremlMl Let Yi^n = >i A n^/^. Then 

£;y.2^ = 1 - e„ with En^O. (8) 

Further 

pn^^P poo 

EY^^ = / a;^dP(yi < x) < n^/P / ^^^^(yi < x) = n^/P. (9) 
JO JO 

Now fix an arbitrarily small e > and let n be large enough in order to have e„ < e/2. 
Using Lemma 13.11 with ([8]) and ([9]) it follows that 



P {Yi^n + ■■■ + <{l-e){n- 1)) < exp (-^(n - if-^/^^ . 



(10) 



Next we observe that 



<^{% + ---+^;'n<(i-eM 

1 \P/2 



This and (flOl) give 

n£;|,5i,„|f = nE 



z^fc=i -"a 



< nP(% + • • • + Y^r, < (1 - e)n) + i^jyi^ (^^j n^-^/^ 

/ 1 \ P/2 

r.. E\Yi\P [ n^"P/2 (n — )• 00). (11) 
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On the other hand we have 

V2 \ P/2 



L-p/2 

^ ^ \ fc=2 / 

~^|yi|V-P/2 (n^oo), (12) 

where in the last step we used the law of large numbers to obtain Sfc=2 ^ 1) ~^ 1- 
Now (jlip holds for arbitrarily small e > 0. Together with ()12p the proof follows. □ 



Proof of Theorem \2.3[ We borrow an idea of Albrecher and Teugels [Ij. The crucial trick 
is to write 

s^-^e-'^'ds, 7 > 0. 



1 _ 1 

off ~ r(7) 

Then, since 



Z^A:=1 -"fc 



we obtain 

with i^i(s) := E[e~'^^i) and 9^2(5) := E (jY^]"'e~^'^i^ . Choosing a„ — ;> 00 such that 
na^^L{an) — )■ 1 one easily shows (see [T]) that for any s > 

lim (^"-1 f — ^ = e-'. (13) 

n-5>oo \CLnJ 

In order to determine '/32(s) we introduce 

r^^,{x)=[ t^e'^Ut, 7 > 1, s,a;>0. 

JO 

Note that lim^^^oo ^p,s{x) = s~'''^"*""^^r(7 + 1). Further we let F be the distribution function 
of Y^. Using integration by parts, we get 

poo pOG 

/ F{x)dr^^six) = s-(^+^)r(7 + 1) - / r^,s(x)(iF(x). 

Jo Jo 
A simple consequence is that 

poo poo poo 

/ r^,,(x)dF(x) = / (1 -F(x))dr^,,(x) = / x^e-'^'il- F{x))dx. 
Jo ' Jo ' Jo 

Since 7r^_i^s(x) — sr^^s(x) = x'^e~^^ we conclude that 

;>oo 

(^2(5) = / x^e-"^dF(x) 

/"OO /"OO 

= 7 / r-y_i,s(x)dF(x) - s / T^^s{x)dF{x) 
Jo Jo 

poo poo 

= 7/ x^-\l- F{x))e-'^dx- s x^{l-F{x))e-'''dx. 
Jo Jo 
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By our assumption 1 — F{x) ~ x ^i{x) and thus by Karamata's Tauber theorem (see e.g. 
Bingham et al. [5l Theorem 1.7.6]) we have for any p > —1 

/•CO 

/ xP{l- Fix))e-'''dx ^T{p)s-feil/s) as s ^ 0. 
Jo 

Combining with our just derived formula for ^2{s) we have for 7 > 1 

^2(^)~^^r(7-l) ass^O. (14) 
Finally consider the quantity 

n f'°° 

nE\6lJ'' = ^ t''-'ipr\t)Mt)dt. 

It is easy to show that n f~^Lp^~^{t)(p2{t)dt — )• for all e > 0. Hence we can restrict 
the integration to the compact interval [0, e], on which Lemma 5.1 of Fuchs et al. |11] 
can be used to uniformly bound the integrand above by an integrable function. Using the 
already defined On, we therefore get from (llSh . (1140 and dominated convergence 

f<~\l-\t/an)v2{t/an)dt 



r(7) \anj Jo 
r(7 - 1) nijan) 
r(7) an 



as n — )• 00. 



The relation "^^"""^ ~ tt-H concludes the proof. □ 

The proof of Theorem 12.21 is similar to the proof of Theorem 12.31 and will therefore be 
omitted. 
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